Privacy-preserving Kruskal-Wallis test

نویسندگان

  • Suxin Guo
  • Sheng Zhong
  • Aidong Zhang
چکیده

Statistical tests are powerful tools for data analysis. Kruskal-Wallis test is a non-parametric statistical test that evaluates whether two or more samples are drawn from the same distribution. It is commonly used in various areas. But sometimes, the use of the method is impeded by privacy issues raised in fields such as biomedical research and clinical data analysis because of the confidential information contained in the data. In this work, we give a privacy-preserving solution for the Kruskal-Wallis test which enables two or more parties to coordinately perform the test on the union of their data without compromising their data privacy. To the best of our knowledge, this is the first work that solves the privacy issues in the use of the Kruskal-Wallis test on distributed data.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Running head: A COMPARISON OF THE EXACT KRUSKAL-WALLIS A comparison of the Exact Kruskal-Wallis Distribution to Asymptotic Approximations for All Sample Sizes

We generated exact probability distributions for sample sizes up to 35 in each of three groups ( 105 N  ) and up to 10 in each of four groups ( 40 N  ). We provided a portion of these exact probability tables and compared the exact distributions to the chi-square, gamma, and beta approximations. The beta approximation was best in terms of the root mean squared error. At specific significance ...

متن کامل

A S A S R Macro for the Multivariate Extension of the Kruskal - Wallis Test Including Multiple Comparisons : Randomization and Z 2 Criteria 1 Warren

In a multi-group experimental design where interest is in a univariate response, the nonparametric Kruskal-Wallis test [Kniskal and Wallis (1952)] provides a potentially more powerful alternative to the parametric one-way analysis of variance when the assumptions of normality are in question. For multivariate response, Puri and Sen (1966) proposed a generalization of the KmskalWallis test that ...

متن کامل

Combinatorics and Statistical Issues Related to the Kruskal-Wallis Statistic

We explore criteria that data must meet in order for the Kruskal-Wallis test to reject the null hypothesis by computing the number of unique ranked 1 data sets in the balanced case where each of the m alternatives has n observations. We show that the Kruskal-Wallis test tends to be conservative in rejecting the null hypothesis, and we offer a correction that improves its performance. We then co...

متن کامل

Power study of anova versus Kruskal-Wallis test

This paper describes the comparison of the anova and the KruskalWallis test by means of the power when violating the assumption about normally distributed populations. The permutation method is used as a simulation method to determine the power of the test. It appears that in the case of asymmetric populations the non-parametric Kruskal-Wallis test performs better than the parametric equivalent...

متن کامل

Nonparametric Evaluation of Quantitative Traits in Population-Based Association Studies when the Genetic Model is Unknown

Statistical association between a single nucleotide polymorphism (SNP) genotype and a quantitative trait in genome-wide association studies is usually assessed using a linear regression model, or, in the case of non-normally distributed trait values, using the Kruskal-Wallis test. While linear regression models assume an additive mode of inheritance via equi-distant genotype scores, Kruskal-Wal...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computer methods and programs in biomedicine

دوره 112 1  شماره 

صفحات  -

تاریخ انتشار 2013